Binary-class and Multi-class Chinese Textural Entailment System Description in NTCIR-9 RITE
نویسندگان
چکیده
In this paper, we describe the details of our system for NTCIR-9 RITE. We sent 3 runs for each of the four sub-tasks: CT-BC, CTMC, CS-BC, and CS-MC. Our approach to the NTCIR-9 RITE task is based on the standard supervised learning classification. We integrate available computational linguistic resources of Chinese language processing to build the system in a statistical natural language processing approach. First, we observed the training corpus and list all possible features. Second, we test the features on training data and find features that can be used to identify textual entailment. The features include surface text, semantic and syntactical information, such as POS tagging, NER tagging, and dependency relation. An automatic annotation subsystem is built to annotate the training corpus. Finally, the annotated data is used in training statistical models and build the classifier for the RITE 1 subtasks.
منابع مشابه
WUST SVM-Based System at NTCIR-9 RITE Task
ABSTRACT This paper describes our work in NTCIR-9 on RITE Binary-class (BC) subtask and Multi-class (MC) subtask in Simplified Chinese. We use classification method and SVM classifier to identify the textual entailment. We totally use thirteen statistical features as the classification features in our system. The system includes three parts: (1) Preprocessing, (2) Feature Extraction, (3) SVM Cl...
متن کاملWUST at NTCIR-10 RITE-2 Task: Multiple Feature Approach to Chinese Textual Entailment
ABSTRACT This paper describes our work in NTCIR-10 on RITE-2 Binary-class (BC) subtask and Multi-class (MC) subtask in Simplified Chinese. We construct the classification model based on support vector machine to recognize semantic inference in Chinese text pair, including entailment and non-entailment for BC subtask and forward entailment, bidirectional entailment, contradiction and independenc...
متن کاملNWNU Minimum Information Recognizing Entailment System for NTCIR-11 RITE-3 Task
This paper describes our work in NTCIR-11 on RITE-3 Binary-class (BC) subtask and Multi-class (MC) subtask in Simplified Chinese. We proposed a textual entailment system using a hybrid approach that integrates many features. The performance of the proposed method in the formal run achieved Macro-F1’s of 59.71% in BC subtask and only 23.19% in MC subtask
متن کاملBinary-class and Multi-class based Textual Entailment System
The article presents the experiments carried out as part of the participation in Recognizing Inference in TExt (RITE-2) @NTCIR10 for Japanese. RITE-2 has four subtasks Binary-class (BC) subtask for Japanese and Chinese, Multi-class (MC) subtask for Japanese and Chinese, Entrance Exam for Japanese and RITE4QA for Chinese. We have submitted three runs in BC subtask for Japanese (JA) (one run), Ch...
متن کاملNTU Textual Entailment System for NTCIR 9 RITE Task
In this paper, we propose a system to deal with the Chinese textual entailment problem for NTCIR-9 RITE task. The RITE task consists of four subtasks, simplified Chinese binary classification (CS_BC), simplified Chinese multi-way classification (CS_MC), traditional Chinese binary classification (CT_BC), and traditional Chinese multi-way classification (CT_MC). According to the definitions of th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011